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ABSTRACT 

We point out a strong time-evolution of the mass-to-light conversion factor rj 
commonly used to estimate masses of dense star clusters from observed cluster 
radii and stellar velocity dispersions. We use a gas-dynamical model coupled with 
the Cambridge stellar evolution tracks to compute line-of-sight velocity disper- 
sions and half-light radii weighted by the luminosity Stars at birth are assumed 
to follow the Salpeter mass function in the range [0. 15-17 M Q ]. We find that 
rj, and hence the estimated cluster mass, increases by factors as large as 3 over 
time-scales of 20 million years. Increasing the upper mass limit to 50 M leads 
to a sharp rise of similar amplitude but in as little as 10 million years. Fitting 
truncated isothermal (Michie-King) models to the projected light profile leads 
to over-estimates of the concentration parameter c of 5c « 0.3 compared to the 
same functional fit applied to the projected mass density. 

Subject headings: stellar dynamics - stellar evolution - NGC4038 - M82 - R136 
- numerical method 



1 Introduction 

The formation of star clusters in bursts of star formation during galactic mergers has 
attracted much attention since the ground-breaking study by Schweizer (1986). Young clus- 
ters can account for ~ 20% of the UV light flux of their sample starburst galaxies (compared 
with < 1% for the Milky Way; e.g. Meurer et al. 1995). Such numbers bring to focus 
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the role that star formation in clusters plays in shaping the overall (galactic) stellar mass 
function. Proto-typical cases where cluster formation has been a spectacular manifestation 
of interaction- induced starbursts are the merging systems NGC 4038/39 (the Antenna?) and 
the nearby galaxy M82 (recent interaction with M81). Many of the brightest clusters in these 
galaxies have estimated ages on the order of 10 7 years, based on optical and near-IR spec- 
tra. High resolution spectroscopic data and HST images have been used to measure velocity 
dispersions and estimate virial masses. The line-of-sight velocity dispersion oxd (LOSVD) 
relates to the mass M and projected half-light radius r ph of a cluster in virial equilibrium 
through 



where rj is a dimensionless free parameter. A number of authors have set t] « 10 in their 
studies to derive M from (1) (Sternberg 1998; Mengel et al. 2002; Smith & Gallagher 2001; 
McCrady et al. 2003; Maraston et al. 2004). McCrady et al give a derivation of this 
value of rj. Mengel et al. (2002) quote a range from 5.6 — 9.7 for King (1966) models with 
concentration parameter in the range 0.5 to 2.5, which corresponds to most galactic globular 
clusters. We discuss the value of rj further below. Dynamical masses derived from (1) have 
been compared to the stellar masses of synthetic populations, using both standard (field) and 
non-standard stellar initial mass functions (henceforth IMF; see Kroupa 2002 for a review). 
The data were found to be inconsistent with a universal IMF (Mengel et al. 2002; Smith & 
Gallagher 2001). In particular, several clusters in M82 were found to be over-luminous with 
respect to their estimated mass (low mass-to-light ratio, M/L*). This suggests that M82 
clusters may form with a top-heavy stellar IMF (Smith & Gallagher 2001; McCrady et al. 
2003). 

These conclusions hinge on the precise value of r] . The above studies have assumed no 
time-variation of 7], based on the belief that the structural parameters of young clusters are 
unlikely to have changed since the time they were born. A further implicit assumption made 
when applying (1) is that the stellar subpopulations sampled are all equally representative 
of the dynamics as a whole. This simplification is normally justified on the grounds that the 
estimated ages of the clusters are too short to allow for gradients in their spatial distribution 
and kinematics to develop from internal evolution. However this train of thought stems from 
the derivation of a long relaxation time t r h for single-population clusters. Expressed in terms 
of the dynamical time t cr evaluated at the half-mass radius Rh (Meylan & Heggie 1997, §7) 
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where R g = GM 2 /\W\ is the gravitational radius and the ratio Rh/R g ~ unity for a wide 
number of model fits to observed clusters. With a spectrum of masses, the trend toward 
equipartition of kinetic energy speeds up evolution, and mass segregation now develops on 
a time-scale given by (Farouki & Salpeter 1982; Spitzer 1987) 

t ms _ 7t (m) ~p 
t r h 3 max{m} p 

where p = M/2/(47ri?^/3) is the mean density inside the half- mass radius (an over-line 
denotes averaging over space, and brackets averaging by mass). The numerical coefficients 
entering (3) are derived from a stellar IMF and mass distribution. The Galactic-field IMF 
covers a range from m 0.08 M to > 60 M Q (O-stars) and possibly all the way to ~ 100 M 
(Kroupa 2002). The mean mass for this IMF is (m) « 1.33 M and therefore the ratio 
(m)/max{m} lies in the range 0.013 - 0.022 for the chosen upper limits. This dramatically 
reduces the mass-segregation time-scale (3) and bears on the parameter rj (through time- 
variation of r p h and er l£ j) since the brightest, most massive member stars are also those that 
undergo the most significant segregation and inward migration. The purpose of this Letter 
is to show with numerical modelling that, through this process, the mass conversion factor 
r\ of massive clusters evolves by a factor of a few over periods as short as 20 millions years, 
contrary to expectations of zero evolution. 

2 Numerical method : GasTel 

The equations of motion were integrated numerically based on the gas-dynamical approach 
pioneered by Larson (1970) and developed further by Louis & Spurzem (1991) to include 
anisotropic velocity fields. The method leans on an analogy between exchange of kinetic 
energy through star-star interactions and the classical heat-diffusion process of fluid dynamics 
(Lynden-Bell & Eggleton 1980; Heggie & Ramamani 1989). The implementation in spherical 
coordinates we used is largely due to Louis & Spurzem (1991) and Giersz & Spurzem (1994) 
but extended to include a spectrum of stellar masses (Spurzem & Takahashi 1995). 

The mass spectrum is sampled at constant logarithmic increments in the interval {mo, mi}; 
we used 14 bins in our standard runs (Sham ~ 0.329). Star-star interactions (including those 
between stars of the same mass bin) lead to the diffusion of kinetic energy. Roughly speaking, 
the resulting change in the velocity dispersion of each mass bin causes a readjustment of the 
density profile. This is obtained using a semi-implicit Henyey integration. The gravitational 
potential is then updated by applying Poisson's equation (see Louis & Spurzem 1991). 
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We checked that the mass-segregation time (3) obtained for GasTel is in quantitative 
agreement with N-body and Fokker-Planck integrators by computing the evolution of Plum- 
mer models with three species of stars, (cf. Spitzer & Shull 1975, Fig. 1). Spurzem & 
Takahashi (1995) report excellent agreement with (3) from their two-component test calcu- 
lations. 



2.1 Stellar evolution 

The different mass components are evolved according to the Cambridge tracks (Pols et 
al. 1998; Hurley et al. 2000). The tracks are efficiently coded in the form of fitting functions 
of the kind first presented by Eggleton, Fitchett & Tout (1989). The functions return the 
current bolometric luminosity, radius, mass and metal abundance for given time and initial 
metal abundance z Q ; in this contribution we set z Q = 0.02 (solar abundance) throughout. 

A filter can be applied to the bolometric luminosity from model stellar atmospheres, 
and the total flux in a specified waveband read from the Basel stellar library (Lejeune et 
al. 1998). We have mapped the stellar luminosity near the strongest near-IR CO bandhead 
(A ~ 2.2 - 2.29fim) and the Call triplet ( A ~ 8200 - 8600A). 

3 Flux-weighted scheme & stellar IMF 

Low gradients in metal abundances in young clusters can be interpreted to imply a 
spatial distribution of stars independent of their mass at the time the clusters formed (e.g. 
Suntzeff 1993). All the calculations presented here have no built-in segregation initially. 

The flux-weighted LOSVD is obtained, first by averaging the square velocity dispersion 
within a projected radius R for a given stellar mass ; then by averaging over all stellar 
masses, using as statistical weight the total luminosity A^ A (m) of mass component m over a 
wavelength interval 5X centered on A. The mean square velocity of stars of mass m, a^, is 
computed from their surface density E m 



The dispersions cr^ are summed over all masses using the luminosity as statistical weight, 
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where n(m) is the number of stars in the logarithmic mass interval In m, In m + 5 lnm. The 
normalisation constant W\ is the total light flux at wavelength A from all stars at time t. 
Note that n(m) in (4) refers to the IMF. All time- dependencies are encapsulated in A^ A (m[t]) 
and a 2 m . 

For the mass range 1 — 60 M Q (up to O-stars), the single-star IMF is well represented 
by a Salpeter (1955) power-law, n(m)dln m oc m" 2,35 dm. We set the mass range {m ,mi} 
to cover two decades, from 0.15 to 17 M (spectral type M5 to B5). Note that solar 
neighbourhood data favour an IMF that flattens out below 1 M Q (Scalo 1986; Kroupa et 
al. 1993), so our choice of an Salpeter IMF may seem artificial. However for the chosen 
mass range, we compute (m)/mi ~ 0.025, a ratio nearly identical to that obtained from the 
Galactic-field IMF. Thus our choice of mass function and mass range yields a conservative 
mass-segregation timescale compared with other IMF's when the latter are extended beyond 
60 Mp,. 



4 Results 

The runtime of our calculations was set to 50 Myrs. All models were scaled to N = 
500, 000 member stars and a total mass of M ~ 2 1O 5 M . We vary the dynamical evolution 
time between runs by constructing models with different sizes and central densities, while 
keeping the total mass, and N, constant. Mengel et al. (2002) fit the light profile of young 
Antennas clusters with King models of dimension-less parameter ty/a 2 = 6 to 9. We first 
setup three models with identical \J//cx 2 = 6 King parameter but each with different half-light 
radius (Table 1). The model labeled 'M82-F' has a mean surface density ~ 1.5 10 4 M Q /pc 2 
or half the value we derive for that cluster from the data of Smith & Gallagher (2001). 
The densest model is labeled 'R136', in reference to the 30 Doradus cluster (central volume 
density ~ lO 6 M /pc 3 ; Brandl et al. 1996). Finally, a third low-density model was evolved 
for comparison (labeled 'Low'). 

We start with the densest model 'R136' for which we compute the shortest relaxation 
time t r h- Since M,r p h and (a 2 ) are all known from the simulations, we solve for 77 directly 
from (1). The run of rj in time is displayed on Fig. 1(a). Three curves are shown, correspond- 
ing to bolometric light and filters centered on the CO continuum and the Call triplet. The 
rapid increase of rj from an initial value ~ 8.2 is striking. After 15 Myrs of evolution, 77 has 
more than doubled. The projected half-light radius r p h is displayed alongside rj. In all the 
cases r p h decreases steadily in time, a direct result of the migration of massive stars toward 
the centre. The general trend and quantities are not sensitive to the waveband adopted, for 
both t] and r p h. Note the slow but systematic rise of 77 after « 35 Myrs, when rj > 20 (factor 
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> 2.5 from its initial value; Fig. l[a]). 

The LOSVD changes relatively little over time in comparison: we measure a monotonic 
decrease of (a\) from ps 10.4 to 9.3 km/s (or, -10.6%) for the system as a whole, although 
for individual components evolution was more significant: down « 30% for the most massive 
stars, while the lightest component enjoys an increase of a comparable magnitude. These 
effects can all be traced back to the dynamical mass segregation. 

The time-evolution of rj for all three models is displayed on Fig. 1(b); we have plotted 
only the results for the bolometric light for clarity. Both rj and r p h for the 'Low' model 
stay essentially constant throughout. However the case 'M82-F' (initial surface density > 
10M Q /pc 2 ) shows unmistakable evolution, suggesting that for clusters of such or higher mean 
surface density we may no longer presume a time-independent mass-to-light parameter 77. 

5 Discussion and future work 

When the projected density of massive star clusters exceeds a few xlO 4 M /pc 2 , the 
mass-to-light conversion factor rj used to derive the mass in equation (1) may increase by 
a factor as large as 3 over time-scales of a few xlO 7 years (cf. Fig. l[b] and Table 1). 
Clusters with low surface density have long mass-segregation time-scales and constant rj. 
When we compare clusters with similar mean surface density ~ 41O 4 M /pc 2 but different 
central density initially, the results indicate that the increase in 77 is an increasing function 
of the initial central density (Table 1). Thus the drift of heavy stars toward the centre 
can considerably bias mass estimates of clusters centrally peaked at birth. The resolved 
cluster R136 (NGC 2070) in the 30 Doradus complex is a case in point: its central volume 
density may be as high as ~ lO 6 M /pc 3 with an estimated spherical half-mass radius ~ 1 
pc (Brandl et al. 1996). Several clusters in the Antennas or M82 have estimated masses 
~ 1O 6 M and a projected half-light radius r ph ~ a few pc's. Smith & Gallagher (2001) derive 
M = 1.2 10 6 M Q and r p j L « 2.8 pc for the cluster M82-F. The mean surface density of this 
cluster ~ 2.41O 4 M /pc 2 inside r p h, comparable to our model 'M82-F' ; for comparison, our 
model 'R136' had a mean density twice as high. All calculations were done for a membership 
of N = 500, 000, a factor 2 lower than rich starburst clusters. This may serve to accelerate 
evolution compared with actual massive clusters (cf. Eq. [2]). We recall that our choice of 
an upper mass mi < 20 M yields a mass segregation timescale which is a factor of 3 longer 
than if we had included O-stars. A re-run of the case 'M82-F' with a mass spectrum widened 
to 50 M Q yielded a rapid increase of rj over a shorter, 10 Myrs-timescale (open dots, Fig. 
l[b]). Therefore the evolution of rj through mass segregation of stars is a sensitive function 
of both the IMF and the cluster surface density profiles. We mention that the cluster M82-F 
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has an age of ~ 60 ± 20 Myrs, by which time the increase of r\ is maximum. This larger value 
of t] would go some way toward solving the apparent over-luminosity of young M82 clusters 
with respect to a standard IMF. Recent work on M82-F has highlighted the possibility of 
strong mass segregation in that cluster (McCrady et al. 2004). 

The strong evolution of rj for model 'R136' (Fig. l[a]) suggested to us to compare 
functional fits to the total surface density and luminosity profiles. We performed least- 
square fits (Press et al. 1992) using King profiles on the simulation grid out to 5r ph . Doing 
this for model 'R136' after 35 Myrs of evolution we found a best fit to the radial luminosity 
profile (cf. §3) that gave a King parameter ^ /a 2 ~ 7.6 (compared with ~ 6 for the run 
of density). The concentration parameter c = loglrj/rol increases by ~ 0.45 as \I//cr 2 runs 
from 6 to 7.6. Since the truncation radius r t remains constant, the central region seemingly 
shrinks by a factor « 2.8 compared with the surface density. Table 1 gives values of (77) and 
(Xph) averaged over the last 25 Myrs of evolution for this and two other models, which also 
show increased concentration. The effect can be cast in the context of a whole population 
of clusters as detected in interacting galaxies such as the Antennae (NGC 4038/9) or M82. 
Our modelling indicates that the clusters should be yet more massive than estimated until 
now, and less concentrated. Therefore their binding energy per unit mass will be less than 
deduced from profiling the light. However it has not escaped our attention that fitting the 
light flux would require a convolution with the point-spread function of an instrument to 
allow for direct comparisons with observations. More detailed models will be presented in a 
follow-up study (Fleck et al., in preparation). 
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Fig. 1. — Parameter rj and half-light radius r p h versus time. The initial models were unseg- 
regated King \1/ jo 2 = 6 models, a) results using filters at three different wavebands (see text 
for details); b) results for models with different initial surface density (cf. Table 1). When 
the mass range is increased from 17 M & to 50M Q r\ rises sharply over the first 10 Myrs (open 
circles, case 'M82-F'); thereafter it rejoins the curve displayed. 
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Table 1: Parameters of the models. All models have a total mass M = 2 x 1O 5 M . We set 
Rh = Rg/2 and p = p in (3) to average t ms . The subscript denotes t = 0; hats denote 
time-averages over the last 25 Myrs of evolution. The last column gives the increase of 
concentration parameter 5c for a few cases (§§5.1, 5.2). 

Name Model t ms r phfi f^ h E (0) £ o (v0 Vo V Sc 

m/a 2 Myrs pc pc 10 5 M Q /pc 2 10 5 M & /pc 2 



'R136' 6 9.3 1.01 0.51 ± 0.03 0.885 0.417 8.15 18.5 ±1 0.29 

'M82-F' 6 15.9 1.46 1.02 ± 0.06 0.474 0.224 8.15 12.6 ±0.7 

Low 6 47.9 3.05 2.84 ± 0.05 0.109 0.051 8.15 8.54 ±0.06 



'Plummer' 4.5 7.5 1.05 0.62 ± 0.03 0.606 0.410 8.20 16 ±1 0.23 

9 25.7 0.95 0.49 ± 0.02 5.232 0.440 7.76 19 ±4 0.29 



